智能论文笔记

Partial Information Decomposition Reveals the Structure of Neural Representations

David A. Ehrlich , Andreas C. Schneider , Michael Wibral , Viola Priesemann , Abdullah Makkeh

分类：人工智能 | 机器学习 | (统计)机器学习

2022-09-21

在神经网络中，与任务相关的信息由神经元组共同表示。但是，对信息分布在单个神经元之间的特定方式尚不清楚：虽然部分只能从特定的单个神经元中获得，但其他部分是由多个神经元冗余或协同携带的。我们展示了部分信息分解（PID）是信息理论的最新扩展，可以解散这些贡献。由此，我们介绍了“代表性复杂性”的度量，该量度量化了访问跨多个神经元信息的难度。我们展示了这种复杂性如何直接适用于较小的层。对于较大的层，我们提出了子采样和粗粒程序，并证明了后者的相应边界。从经验上讲，为了量化解决MNIST任务的深度神经网络，我们观察到，代表性复杂性通过连续的隐藏层和过度训练都会降低。总体而言，我们建议代表性复杂性作为分析神经表示结构的原则且可解释的摘要统计量。

translated by 谷歌翻译

Three-dimensional micro-structurally informed in silico myocardium -- towards virtual imaging trials in cardiac diffusion weighted MRI

Mojtaba Lashgari , Nishant Ravikumar , Irvin Teh , Jing-Rebecca Li , David L. Buckley , Jurgen E. Schneider , Alejandro F. Frangi

分类：计算机视觉

2022-08-22

在硅组织模型中，可以评估磁共振成像的定量模型。这包括对成像生物标志物和组织微结构参数的验证和灵敏度分析。我们提出了一种新的方法来生成心肌微结构的现实数值幻影。我们扩展了以前的研究，该研究考虑了心肌细胞的变异性，心肌细胞（插入式椎间盘）之间的水交换，心肌微结构混乱和四个钣金方向。在该方法的第一阶段，心肌细胞和钣金是通过考虑心肌到骨膜细胞连接的形状变异性和插入式椎间盘而产生的。然后，将薄板汇总和定向在感兴趣的方向上。我们的形态计量学研究表明，数值和真实（文献）心肌细胞数据的体积，长度以及一级和次要轴的分布之间没有显着差异（$ p> 0.01 $）。结构相关性分析证实了硅内组织与实际组织的混乱类别相同。此外，心肌细胞的模拟螺旋角（HA）和输入HA（参考值）之间的绝对角度差（$ 4.3^\ Circ \ PM 3.1^\ Circ $）与所测量HA之间的绝对角差有很好的一致性使用实验性心脏扩散张量成像（CDTI）和组织学（参考值）（Holmes等，2000）（$ 3.7^\ Circ \ PM6.4^\ Circ $）和（Scollan等，1998）（$ 4.9） ^\ circ \ pm 14.6^\ circ $）。使用结构张量成像（黄金标准）和实验性CDTI，输入和模拟CDTI的特征向量和模拟CDTI的角度之间的角度距离小于测量角度之间的角度距离。这些结果证实，所提出的方法比以前的研究可以为心肌产生更丰富的数值幻象。

translated by 谷歌翻译

ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization

Zdravko Marinov , Alina Roitberg , David Schneider , Rainer Stiefelhagen

分类：计算机视觉

2022-08-19

在设计多模式系统时，模态选择是一个重要的步骤，尤其是在跨域活动识别的情况下，因为某些模态比其他模式更适合域移动。但是，仅选择具有积极贡献的方式需要系统的方法。我们通过提出一种无监督的模态选择方法（ModSelect）来解决此问题，该方法不需要任何地面真相标签。我们确定多个单峰分类器的预测与它们的嵌入之间的域差异之间的相关性。然后，我们系统地计算模态选择阈值，该阈值仅选择具有较高相关性和低域差异的模态。我们在实验中表明，我们的方法ModSelect仅选择具有积极贡献的模态，并始终提高合成到现实域的适应基准的性能，从而缩小域间隙。

translated by 谷歌翻译

Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living

Zdravko Marinov , David Schneider , Alina Roitberg , Rainer Stiefelhagen

分类：计算机视觉

2022-08-03

在活动识别模型的现实应用应用中，域移动（例如外观变化）是一个关键挑战，范围从辅助机器人和智能家居到智能车辆的驾驶员观察。例如，虽然模拟是一种经济数据收集的绝佳方式，但合成到现实的域转移导致识别日常生活活动（ADL）的精度下降> 60％。我们应对这一挑战，并引入了一个活动域生成框架，该框架从视频培训数据推断出的不同现有活动方式（源域）中创建了新颖的ADL外观（新域）。我们的框架计算人体姿势，人体关节的热图和光流图，并将它们与原始RGB视频一起使用，以了解源域的本质，以生成全新的ADL域。通过最大化现有源外观和生成的新颖外观之间的距离，同时确保通过额外的分类损失保留活动的语义，可以优化该模型。虽然源数据多模态在此设计中是一个重要的概念，但我们的设置不依赖于多传感器设置（即，仅从单个视频中推断出所有源模式。）然后将新创建的活动域集成到训练中。 ADL分类网络，导致模型不太容易受到数据分布的变化的影响。对合成基准的SIMS4Action进行的广泛实验证明了域产生范式对跨域ADL识别的潜力，从而设定了新的最新结果。我们的代码可在https://github.com/zrrrrr1997/syn2real_dg上公开获得

translated by 谷歌翻译

Affect-DML: Context-Aware One-Shot Recognition of Human Affect using Deep Metric Learning

Kunyu Peng , Alina Roitberg , David Schneider , Marios Koulakis , Kailun Yang , Rainer Stiefelhagen

分类：计算机视觉

2021-11-30

人类影响识别是一个完善的研究领域，具有许多应用，例如心理护理，但现有方法认为所有兴趣情绪都是先验的作为注释培训例子。然而，通过新颖的心理学理论的人类情感谱的粒度和改进的上升和改善语境中的情绪增加给数据收集和标签工作带来了相当大的压力。在本文中，我们在语境中概念化了对情绪的一次性识别 - 一种新的问题，旨在识别来自单个支持样品的更精细粒子水平的人体影响。为了解决这项具有挑战性的任务，我们遵循深度度量学习范例，并引入多模态情绪嵌入方法，通过利用人类外观的互补信息和通过语义分割网络获得的语义场景上下文来最大限度地减少相同情绪嵌入的距离。我们上下文感知模型的所有流都使用加权三态丢失和加权交叉熵损失来共同优化。我们对适应我们单次识别问题的Demotic DataSet的分类和数值情感识别任务进行了彻底的实验，揭示了从单一示例中分类人类影响是一项艰巨的任务。尽管如此，我们模型的所有变体都明显优于随机基线，同时利用语义场景上下文一致地提高了学习的表示，在一次射击情感识别中设置最先进的结果。为了促进对人类影响国家的更普遍表示的研究，我们将在https://github.com/kpeng9510/affect-dml下公开向社区公开提供我们的基准和模型。

translated by 谷歌翻译

Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning

Thanh Le-Cong , Duc-Minh Luong , Xuan Bach D. Le , David Lo , Nhat-Hoa Tran , Bui Quang-Huy , Quyet-Thang Huynh

分类：机器学习

2023-01-03

In this paper, we propose a novel technique, namely INVALIDATOR, to automatically assess the correctness of APR-generated patches via semantic and syntactic reasoning. INVALIDATOR reasons about program semantic via program invariants while it also captures program syntax via language semantic learned from large code corpus using the pre-trained language model. Given a buggy program and the developer-patched program, INVALIDATOR infers likely invariants on both programs. Then, INVALIDATOR determines that a APR-generated patch overfits if: (1) it violates correct specifications or (2) maintains errors behaviors of the original buggy program. In case our approach fails to determine an overfitting patch based on invariants, INVALIDATOR utilizes a trained model from labeled patches to assess patch correctness based on program syntax. The benefit of INVALIDATOR is three-fold. First, INVALIDATOR is able to leverage both semantic and syntactic reasoning to enhance its discriminant capability. Second, INVALIDATOR does not require new test cases to be generated but instead only relies on the current test suite and uses invariant inference to generalize the behaviors of a program. Third, INVALIDATOR is fully automated. We have conducted our experiments on a dataset of 885 patches generated on real-world programs in Defects4J. Experiment results show that INVALIDATOR correctly classified 79% overfitting patches, accounting for 23% more overfitting patches being detected by the best baseline. INVALIDATOR also substantially outperforms the best baselines by 14% and 19% in terms of Accuracy and F-Measure, respectively.

translated by 谷歌翻译

Conservation Tools: The Next Generation of Engineering--Biology Collaborations

Andrew Schulz , Cassie Shriver , Suzanne Stathatos , Benjamin Seleb , Emily Weigel , Young-Hui Chang , M. Saad Bhamla , David Hu , Joseph R. Mendelson III , .

分类：机器学习

2023-01-03

The recent increase in public and academic interest in preserving biodiversity has led to the growth of the field of conservation technology. This field involves designing and constructing tools that utilize technology to aid in the conservation of wildlife. In this article, we will use case studies to demonstrate the importance of designing conservation tools with human-wildlife interaction in mind and provide a framework for creating successful tools. These case studies include a range of complexities, from simple cat collars to machine learning and game theory methodologies. Our goal is to introduce and inform current and future researchers in the field of conservation technology and provide references for educating the next generation of conservation technologists. Conservation technology not only has the potential to benefit biodiversity but also has broader impacts on fields such as sustainability and environmental protection. By using innovative technologies to address conservation challenges, we can find more effective and efficient solutions to protect and preserve our planet's resources.

translated by 谷歌翻译

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang , David M. Blei , John P. Cunningham

分类： (统计)机器学习 | 机器学习

2023-01-02

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.

translated by 谷歌翻译

Mapping smallholder cashew plantations to inform sustainable tree crop expansion in Benin

Leikun Yin , Rahul Ghosh , Chenxi Lin , David Hale , Christoph Weigl , James Obarowski , Junxiong Zhou , Jessica Till , Xiaowei Jia , Troy Mao

分类：计算机视觉 | 机器学习

2023-01-01

Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.

translated by 谷歌翻译

Morphology-based non-rigid registration of coronary computed tomography and intravascular images through virtual catheter path optimization

Karim Kadry , Abhishek Karmakar , Andreas Schuh , Kersten Peterson , Michiel Schaap , David Marlevi , Charles Taylor , Elazer Edelman , Farhad Nezami

分类：计算机视觉

2022-12-30

Coronary Computed Tomography Angiography (CCTA) provides information on the presence, extent, and severity of obstructive coronary artery disease. Large-scale clinical studies analyzing CCTA-derived metrics typically require ground-truth validation in the form of high-fidelity 3D intravascular imaging. However, manual rigid alignment of intravascular images to corresponding CCTA images is both time consuming and user-dependent. Moreover, intravascular modalities suffer from several non-rigid motion-induced distortions arising from distortions in the imaging catheter path. To address these issues, we here present a semi-automatic segmentation-based framework for both rigid and non-rigid matching of intravascular images to CCTA images. We formulate the problem in terms of finding the optimal \emph{virtual catheter path} that samples the CCTA data to recapitulate the coronary artery morphology found in the intravascular image. We validate our co-registration framework on a cohort of $n=40$ patients using bifurcation landmarks as ground truth for longitudinal and rotational registration. Our results indicate that our non-rigid registration significantly outperforms other co-registration approaches for luminal bifurcation alignment in both longitudinal (mean mismatch: 3.3 frames) and rotational directions (mean mismatch: 28.6 degrees). By providing a differentiable framework for automatic multi-modal intravascular data fusion, our developed co-registration modules significantly reduces the manual effort required to conduct large-scale multi-modal clinical studies while also providing a solid foundation for the development of machine learning-based co-registration approaches.

translated by 谷歌翻译